Multi modal multi-semantic image retrieval

نویسنده

  • Kraisak Kesorn
چکیده

.................................................................................................................... viii ACKNOLWEDGEMENTS .................................................................................................... x ABBREVIATIONS ................................................................................................................. xi CHAPTER 1 INTRODUCTION ......................................................................................... 1 1.1 Motivation ....................................................................................................................... 2 1.1.1 Representing Visual Content and Classifying Images ........................................... 3 1.1.2 Ambiguity of Natural Language in Text Captions ................................................ 4 1.1.3 Use of Hybrid Visual and Textual Metadata Models ............................................ 5 1.2 Research Objectives ........................................................................................................ 6 1.3 Structure of this Thesis .................................................................................................... 8 CHAPTER 2 FUNDAMENTALS ...................................................................................... 10 2.1 Main Processes for IMR ................................................................................................ 10 2.2 Content-Based Image Retrieval (CBIR) ........................................................................ 13 2.2.1 Global Features .................................................................................................... 14 2.2.2 Local Features ..................................................................................................... 15 2.3 Semantic-Based Image Retrieval (SBIR) ...................................................................... 16 2.3.1 Knowledge (Ontology-based) Representation Techniques ................................. 18 2.3.2 Advantages of Using Ontologies for IR .............................................................. 21 2.4 MPEG-7 and Ontology-based KB ................................................................................. 21 2.5 Summary ........................................................................................................................ 25 CHAPTER 3 SURVEY AND ANALYSIS OF THE STATE-OF-THE-ART FRAMEWORKS ......................................................................................... 26 3.1 Problem Analysis of the Image Retrieval Systems ........................................................ 26 3.2 Formal Requirements of Image Retrieval Systems ....................................................... 28 3.3 Survey of State of the Art Frameworks ......................................................................... 29 3.3.1 Ontology-Based KB Frameworks for IMR ......................................................... 29 3.3.2 Visual Features-Based Frameworks for Visual Content Representation ............. 34 3.4 Discussion ...................................................................................................................... 39 3.5 Summary ........................................................................................................................ 42

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Public Transport Ontology for Passenger Information Retrieval

Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...

متن کامل

Image Retrieval: Content versus Context

In this paper, we introduce a new approach to image retrieval. This new approach takes the best from two worlds, combines image features (content) and words from collateral text (context) into one semantic space. Our approach uses Latent Semantic Indexing, a method that uses co-occurrence statistics to uncover hidden semantics. This paper shows how this method, that has proven successful in bot...

متن کامل

Exploiting Multimedia Content: a Machine Learning Based Approach

This thesis explores use of machine learning for multimedia content management involving single/multiple features, modalities and concepts. We introduce shape based feature for binary patterns and apply it for recognition and retrieval application in single and multiple feature based architecture. The multiple feature based recognition and retrieval frameworks are based on the theory of multipl...

متن کامل

Bayesian non-parametrics for multi-modal segmentation

Segmentation is a fundamental and core problem in computer vision research which has applications in many tasks, such as object recognition, content-based image retrieval, and semantic labelling. To partition the data into groups coherent in one or more characteristics such as semantic classes, is often a first step towards understanding the content of data. As information in the real world is ...

متن کامل

Multi-Modal Retrieval for Multimedia Digital Libraries: Issues, Architecture, and Mechanisms

Supporting effective and efficient retrieval of multimedia data is a challenging problem in building a digital library. In this paper, we examine the issues related to accommodating multi-modal retrieval of multimedia data (text, image, video and audio), and propose 2M2Net as a generic framework for such versatile retrieval in multimedia digital libraries. The retrieval is conducted based on th...

متن کامل

A Bag of Semantic Words Model for Medical Content-based Retrieval

The bag of visual words model has been widely used in contentbased image retrieval. However, when it is applied to medical domain, it potentially has several limitations, e.g., some ordinary feature descriptors may not be able to capture the subtle characteristics of medical images; there is a semantic gap between the low-level features and the medical concepts; the emerging multi-modal data po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010